Distributing audio signals for an audio/video presentation

ABSTRACT

A mobile communication device and an “app” for a mobile communication device for synchronizing audio with a delayed video displayed on a display device, including (a) receiving an audio signal over a wireless communication system, the audio signal containing audio information corresponding to delayed video information displayed on a display device, wherein the audio information is received at the mobile communication device prior to the corresponding delayed video information being displayed on the display device; (b) providing a user interface to the user including an audio delay control allowing the user to control an amount of delay added to the audio information for synchronizing output of the audio information with the delayed video information displayed on the display device; and (c) outputting the audio information via an audio interface of the mobile communication device delayed by the amount of delay added via the audio delay control.

CROSS-REFERENCE TO RELATED APPLICATION(S)

This patent application is a continuation-in-part of, and therefore claims priority from, U.S. patent application Ser. No. 13/803,876 filed Mar. 14, 2013, which is a continuation-in-part of, and therefore claims priority from, U.S. patent application Ser. No. 13/734,382 filed Jan. 4, 2013, which claims the benefit of U.S. Provisional Patent Application No. 61/583,758 filed Jan. 6, 2012 and which also is a continuation-in-part of, and therefore claims priority from, U.S. patent application Ser. No. 12/971,559 filed Dec. 17, 2010, which claims the benefit of the following United States Provisional Patent Applications: U.S. Provisional Patent Application No. 61/390,352 filed Oct. 6, 2010; U.S. Provisional Patent Application No. 61/385,997 filed Sep. 24, 2010; and U.S. Provisional Patent Application No. 61/288,103 filed on Dec. 18, 2009. This patent application also claims the benefit of the following United States Provisional Patent Applications: U.S. Provisional Patent Application No. 62/045,879 filed Sep. 4, 2014, 62/046,480 filed Sep. 5, 2014, 62/049,675 filed Sep. 12, 2014, and 62/049,752 filed Sep. 12, 2014. Each of the above-referenced patent applications is hereby incorporated herein by reference in its entirety.

TECHNICAL FIELD

The present invention relates to synchronizing audio signals on one device with video signals on the same or another device.

BACKGROUND ART

It is known in the prior art to provide multiple television displays in a public setting so that viewers can choose to watch any one of multiple television programs. Sometimes these displays broadcast without any audio or include audio that is difficult to hear in the display's environment. In some settings, these displays may include subtitles, but subtitles may prove difficult to keep up with or may be disharmonious with the displayed video.

It is also known in the prior art to broadcast television and radio simultaneously, in a process often referred to as simulcasting or simultaneous broadcasting. This method of broadcasting, used primarily before TV sound transmission was possible, includes transmitting the audio or sound portion of a television broadcast to a radio, while transmitting the corresponding video portion to a television.

It is also known in the prior art to transmit sound to a device separate from a television display. Hospitals often accommodate such transmissions, for example through an audio system on patient's bed.

While technological advances in television broadcasts have significantly reduced the use of broadcasting techniques such as simulcasting, some of these advances have introduced new challenges and invited opportunity for additional innovations in the wake of social television broadcasting's increasing popularity.

SUMMARY OF VARIOUS EMBODIMENTS

In accordance with one embodiment, a computer program product comprises a tangible, non-transitory computer-readable medium having embodied therein computer program instructions that, when run on a mobile communication device of a user, cause the mobile communication device to execute a method for controlling audio associated with a delayed video displayed on a display device. The method involves (a) receiving an audio signal over a wireless communication system, the audio signal containing audio information corresponding to delayed video information displayed on a display screen of a display device, wherein the audio information is received at the mobile communication device prior to the corresponding delayed video information being displayed on the display screen of the display device; (b) providing a user interface to the user including an audio delay control allowing the user to adjust an amount of delay to be introduced by the mobile communication device when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; and (c) outputting, with such amount of introduced delay, the audio information via an audio interface of the mobile communication device.

In accordance with another embodiment, a mobile communication device comprises a wireless communication interface; an audio interface; and an audio processor application for controlling audio associated with a delayed video displayed on a display device, the audio processor application configured to (a) receive an audio signal over the wireless communication interface, the audio signal containing audio information corresponding to delayed video information displayed on a display screen of a display device, wherein the audio information is received at the mobile communication device prior to the corresponding delayed video information being displayed on the display screen of the display device; (b) provide a user interface to the user including an audio delay control allowing the user to adjust an amount of delay to be introduced by the mobile communication device when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; and (c) output, with such amount of introduced delay, the audio information via an audio interface of the mobile communication device.

In various alternative embodiments, the audio delay control may allow the user to incrementally adjust the amount of delay. The display device may be separate from the mobile communication device or may be integral to the mobile communication device, which case delayed video information may be received over the wireless communication system and may be displayed on the integral display device. Advertisements may be received over the wireless communication system and such advertisements may be displayed on a display device of the mobile communication device. Such advertisements may be selected based on such things as a location of the mobile device; an identity of the mobile device; an identity of a user of the mobile device; the video information being displayed on the display screen of the display device; or the audio information.

In certain embodiments, an image of at least a portion of the display screen may be captured using an image capture device of the mobile communication device and information derived from the captured image may be transmitted to a server for identifying an audio channel associated with the image from among a plurality of audio channels, wherein the audio information received by the mobile communication device is associated with the identified audio channel.

In certain embodiments, an image associated with a display device from among a plurality of display devices may be captured using an image capture device of the mobile communication device and information derived from the captured image may be transmitted to a server for identifying the display device, wherein the audio information received by the mobile communication device is associated with the identified display device.

In certain embodiments, an amount of delay to add to the audio information for synchronizing the audio information with the delayed video information displayed on the display screen of the display device may be automatically determined, wherein the total amount of delay added to the audio information is based on such automatically determined amount of delay and any adjustment made by the user via the audio delay control.

In accordance with another embodiment, a computer program product comprises a tangible, non-transitory computer-readable medium having embodied therein computer program instructions that, when run on a mobile communication device of a user, cause the mobile communication device to execute a method for controlling audio associated with a video displayed on a display screen of a display device. The method comprises providing a user interface to the user including an image capture control allowing the user to capture an image of at least a portion of the display screen using an image capture device of the mobile communication device; and transmitting, to a server, information derived from the captured image for identifying an audio channel associated with the image from among a plurality of audio channels, wherein the audio information received by the mobile communication device is associated with the identified audio channel.

In accordance with another embodiment, a mobile communication device comprises a wireless communication interface; an image capture device; and an audio processor application for controlling audio associated with a video displayed on a display device, the audio processor application configured to provide a user interface to the user including an image capture control allowing the user to capture an image of at least a portion of the display screen using the image capture device; and transmit, to a server, information derived from the captured image for identifying an audio channel associated with the image from among a plurality of audio channels, wherein the audio information received by the mobile communication device is associated with the identified audio channel.

In accordance with another embodiment, a computer program product comprising a tangible, non-transitory computer-readable medium having embodied therein computer program instructions that, when run on a mobile communication device of a user, cause the mobile communication device to execute a method for controlling audio associated with a video displayed on a display screen of a display device. The method involve providing a user interface to the user including an image capture control allowing the user to capture an image associated with a display device from among a plurality of display devices using an image capture device of the mobile communication device; and transmitting, to a server, information derived from the captured image for identifying the display device, wherein the audio information received by the mobile communication device is associated with the identified display device.

In accordance with another embodiment, a mobile communication device comprises a wireless communication interface; an image capture device; and an audio processor application for controlling audio associated with a video displayed on a display device, the audio processor application configured to provide a user interface to the user including an image capture control allowing the user to capture an image associated with a display device from among a plurality of display devices using the image capture device; and transmit, to a server, information derived from the captured image for identifying the display device, wherein the audio information received by the mobile communication device is associated with the identified display device.

In any of the preceding four embodiments, an audio signal may be received over a wireless communication system, the audio signal containing audio information corresponding to delayed video information displayed on a display screen of a display device, wherein the audio information is received at the mobile communication device prior to the corresponding delayed video information being displayed on the display screen of the display device; a user interface may be provided to the user including an audio delay control allowing the user to adjust an amount of delay to be introduced by the mobile communication device or audio processor application when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; and the audio information may be output with such amount of introduced delay via an audio interface of the mobile communication device

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing features of the invention will be more readily understood by reference to the following detailed description, taken with reference to the accompanying drawings, in which:

FIG. 1 schematically shows a system in which an audio/video processor distributes audio and video information associated with an audio/video signal, in accordance with various embodiments of the invention;

FIG. 2 schematically shows a system in which the audio/video source distributes audio and video information associated with an audio/video signal, in accordance with various embodiments of the invention;

FIG. 3 schematically shows a flow diagram demonstrating processing of a television signal in accordance with various embodiments of the invention;

FIG. 4 schematically shows a flow diagram for synchronizing the audio provided to a mobile device with a video signal displayed on a display device in accordance with various embodiments of the invention;

FIG. 5 schematically shows a system in which processing and distribution of audio and video information associated with an audio/video signal is divided between local and remote audio/video processors, in accordance with various embodiments of the invention;

FIG. 6 schematically shows a system that supports direct-from-provider audio/video synchronization, in accordance with various embodiments of the invention;

FIG. 7 schematically shows a system in which the mobile device receives audio from a source different than the source of the video displayed on the display device, in accordance with various embodiments of the invention;

FIG. 8 shows an exemplary user interface screen in accordance with such an embodiment;

FIG. 9 shows an exemplary user interface screen having the separate digits 902 from which the user can enter the desired channel number;

FIG. 10 shows a variation of FIG. 9 in which the selected channel number 1002 is displayed;

FIG. 11 shows an exemplary user interface screen having an image window, a zoom control, and a “select” button for capturing an image in order to identify the desired television/channel;

FIG. 12 shows a representation of a portion of a television screen containing a pattern that can be used to identify the desired television/channel; and

FIG. 13 is a schematic diagram showing an audio processor in accordance with one exemplary embodiment of Synchronized Personal Sound System for Live Presentations.

It should be noted that the foregoing figures and the elements depicted therein are not necessarily drawn to consistent scale or to any scale. Unless the context otherwise suggests, like elements are indicated by like numerals.

DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS

Definitions. As used in this description and the accompanying claims, the following terms shall have the meanings indicated, unless the context otherwise requires:

An “A/V signal” (also sometimes referred to as a “television signal” in certain contexts) is generally an information signal that contains both video and audio content and perhaps other information (e.g., closed captioning information). An A/V signal may be available from any of a variety of sources, such as various sources described below. An A/V signal may include a television broadcast/program, movie, multimedia computer file (e.g., JPEG, MPEG), streaming media, or other audio/video information. An A/V signal may be transmitted over a communication system (e.g., broadcast television, cable television, satellite television, computer network such as the Internet, etc.). An A/V signal may be stored for later playback (e.g., on a DVD, computer memory, digital video recorder, TV streaming device, etc.). An A/V signal may be available “live” (e.g., a television broadcast, streaming video, etc.) or may be timeshifted and/or placeshifted.

A “mobile device” is a communication device that receives certain signals over a communication network in embodiments of the present invention. A mobile device may be, for example, a cellular phone, smartphone (e.g., iPhone™), personal digital assistant (e.g., Blackberry™), tablet computer, laptop or personal computer, wireless headset (e.g., using an infrared or radio frequency communication link), or other device. Unless the context requires otherwise, references to a specific type of device in exemplary embodiments discussed below (e.g., a phone or smartphone) are not intended to limit an aspect of the present invention to that particular type of device. A mobile device may receive signals over a wireless network (e.g., WiFi, cellular telephone, Bluetooth, etc.), the Internet, and/or other communication network. A mobile device is not necessarily mobile in the sense that it can be stationary at a given time.

A “TV streaming device” is a device that transmits A/V signals (e.g., television programs) over a communication network such as the Internet, typically for timeshifting and/or placeshifting. Examples of TV streaming devices include the SLINGBOX™ TV streaming device from Sling Media and the LOCATIONFREE PLAYER™ TV streaming device from Sony. Such devices typically allow a remote device to receive a television signal over an existing Internet connection, e.g., from a desktop or laptop computer, smartphone, PDA, or television (via an interface device). Such devices typically also allow the mobile device to control an external television signal source such as a cable/satellite box, e.g., to change channels, control DVR functions, etc.

An “A/V signal source” is a source of a video or A/V signal. Examples of A/V signal sources include a terrestrial television broadcast received via a television antenna, analog cable, a digital cable box, a satellite receiver, a digital video recorder (DVR), a DVD player, a Video Cassette Recorder (VCR), a computer (e.g., a server accessible over the Internet or other communication system), a video game console, a video camera or camera phone, a TV streaming device, or other A/V signal source. Sources also may include any of a variety of on-line or off-line, public or private providers, such as, for example, AppleTV, Hulu, Roku, Xbox, Netflix, Amazon video on demand, D-Link, digital signage content providers, etc. that provide video, audio, or audio/video content.

A “display device” is a device that displays video. A display device may be a television, monitor, computer, projector, kiosk, billboard, digital signage, screen, or other type of display device. A display device may play audio, display text (e.g., closed captioning), and/or perform other functions. Unless the context requires otherwise, references to a specific type of display device in exemplary embodiments discussed below (e.g., a television) are not intended to limit an aspect of the present invention to that particular type of device.

In illustrative embodiments of the present invention, the audio portion of an A/V signal is sent to one or more mobile devices and the video portion of the A/V signal is sent to one or more separate display devices. The mobile device(s) may receive the audio portion over a wireless network (e.g., via WiFi, cellular, wireless data network) or other communication network (e.g., the Internet), which inherently adds delay. Consequently, a simultaneous transmission of an unaltered video signal to the display device(s) and the audio signal to the mobile device(s) would result in the audio portion arriving at the mobile device(s) later than the video arrives at the display device(s). The uncoordinated arrival of the video and the audio would result in an incoherent viewing experience in which the audio rendered by the mobile device(s) lags behind the video displayed on the display device.

Accordingly, in embodiments of the present invention, the signal provided to the display device contains video information that is delayed by a predetermined amount greater than the expected amount of delay in the communication network over which the audio information is sent so that the audio is received by the mobile device(s) at or before the video is displayed on the display device. For example, in some embodiments, the expected delay in the communication network may be 5-6 seconds, and therefore the video information may be delayed, for example, by 8-10 seconds before being displayed so that the audio is received by the mobile device(s) prior to display of the video and can be further delayed by the mobile device to synchronize the audio with the video. The signal provided to the display device may include the audio information and/or other information. The amount by which the information provided to the display device (i.e., the video and optionally the corresponding audio and/or other information) is delayed may be fixed or variable/configurable. An application running in the mobile device provides for adding additional delay to the received audio information so that the audio can be synchronized to the corresponding video on the display device (typically, the amount of delay is continuously variable from zero to some predetermined maximum in some predetermined increment, such as in millisecond increments). The application may allow the user to manually add the delay to the audio, e.g., using various types of controls, and/or the application may automatically add delay to the audio pursuant to various synchronization protocols.

Among other things, such audio forwarding with video delay allows one or more users to listen to the audio portion of an A/V signal through his or her respective mobile device, with each user being able to control the delay added to the audio by the mobile device, volume/muting of the audio, and other parameters. Such individualized rendering and control of the audio may be useful in many contexts, such as when video is displayed without audio in locations such as sports bars, restaurants, recreational facilities, shopping centers, airports, and other locations or generally when private or individualized rendering of the audio associated with an A/V signal is desired. Embodiments of the present invention generally provide value-add to advertisers and content providers, who benefit by having users listening to commercials when they otherwise might be looking away from the display device.

Audio Forwarding with Video Delay by Separate Processing Unit

FIG. 1 shows a schematic of a system including a processing unit 102 that performs audio forwarding with video delay, in accordance with various embodiments of the invention. As discussed further below, audio forwarding with video delay schemes in accordance with embodiments of the present invention typically allow audio and/or other information to be provided along with the delayed video to the display device (such information is typically delayed so as to be in synchronization with the delayed video) and may additionally or alternatively allow video and/or other information to be forwarded without delay along with the audio to the mobile device(s). Among other things, the processing unit 102 includes an interface to one or more display devices 103 (e.g., HDMI, component, composite, S-video, DVI, coaxial, wireless, or other interface) and an interface for communicating with one or more mobile devices 105 (in this example, an Ethernet interface for connecting to a wireless router or other communication device, although the processing unit may include an integral wireless router, cable modem, or other communication/networking device).

The processing unit 102 may obtain A/V signals from external and/or internal sources. In FIG. 1, unit 102 includes an interface to an external A/V signal source 101 (in this case, a cable or satellite set top box that may include an analog or digital television tuner, an internal DVR, and/or a modem for video-on-demand and other digital services, coupled to the unit 102, for example, via HDMI, component, composite, S-video, DVI, coaxial, or other interface). Additionally or alternatively, unit 102 may include one or more internal A/V sources, such as an analog or digital tuner, an integrated DVR, an integrated Internet video streaming device, or other type of device.

Among other things, unit 102 includes appropriate hardware and/or software for sending at least the audio portion of a television signal to one or more mobile devices 105 over a communication network (e.g., a wireless network or the Internet) and sending at least the corresponding video portion delayed to one or more display devices. Mobile device 105 may be a smart phone with an application that delays audio for fine tuning. The user may control the variable delay. The user may connect to a server to activate the audio. Advertisements may be displayed on the mobile device screen.

In some embodiments, unit 102 may transmit essentially the same audio and/or video information to both the mobile device 105 and the display device 103, but with the signal provided to the display device 103 delayed. For example, unit 102 may obtain packets of digitized audio/video information (e.g., from an external or internal source), forward the packets to the mobile device 105 over the communication network, and after the predetermined amount of delay, forward the packets to the display device or decode the packets and send decoded information to the display device.

More typically, however, unit 102 decodes the A/V signal information to recover at least the audio portion and transmits the audio portion to the mobile device 105 over the communication network. This may involve, for example, decoding the television signal to recover the analog audio portion, digitizing the analog audio portion using an analog-to-digital converter, and encoding the digitized audio for transmission over the communication network and subsequent receipt of the digitized audio by the mobile device(s). The message may be encoded in any format suitable for receipt by the mobile device.

In FIG. 1, unit 102 sends a signal 107 containing the audio information to a mobile device 105 over a wireless network via a wireless router 104 connected to the unit 102 via an Ethernet port or wireless bridge, although it should be noted that the wireless router may be integral to the unit 102. It also should be noted that unit 102 additionally or alternatively may send the signal containing the audio information over the Internet or other communication network. Unit 102 also sends a signal 106 containing delayed video information to the display device 103. The signal 106 sent to the display device 103 typically, but not necessarily, includes the corresponding audio information delayed in synchronization with the video, e.g., so that the audio can be played from the display device if desired and may include other video elements inserted into the video stream. As discussed above, the video information provided to the display device is delayed by a predetermined amount (which may be configurable) so that the audio is received by the mobile device 105 at or before the video is displayed on the display device.

Thus, processing an A/V signal may include adding a delay to the video component of the A/V signal, e.g., via the software, before the video component is routed to one or more other display devices 103. For display devices that receive both a video and audio component of the A/V signal, the audio component of the A/V signal being sent to that display device may also be delayed in conjunction with the video. The amount of delay may be fixed or variable/configurable and may be determined as a function of the delay incurred between the transmission of the audio signal to a mobile device 105 and the actual arrival of that signal at the mobile device 105.

Unit 102 may be configured to modify and/or supplement the video and/or audio portions provided to the display device 103 and/or the mobile device 105. For example, unit 102 may be configured to add additional information such as a video advertisement to the A/V signal being transmitted to the display device 103. The advertisements may be in a dynamic video or still image format. These advertisements may be added to the video in a variety of display formats, including but not limited to, a horizontal or vertical split screen view, a border, a banner, or any other format specified by the content provider. The advertisements may be downloaded from a server and may be obtained from a third party source such as, for example, an online advertiser.

Similarly, unit 102 may add a device identifier to the signal provided to the display device so that users can identify different display devices on which different video is being displayed. An application running on the mobile device (discuss in more detail below) may allow the user to selectively receive the audio associated with a particular display device based on the device identifier. Audio information associated with different display devices may be provided on different physical or logical communication channels (e.g., different wireless frequencies, different logical connections, etc.).

In various alternative embodiments, unit 102 may be implemented as a stand-alone unit or may be integrated with a cable/satellite set top box, DVR, TV streaming device, computer, or other type of device or combination of devices. Thus, the audio forwarding with video delay functionality may be integrated into set top boxes, DVRs, TV streaming devices, computers, and other types of devices. When integrated with a DVR, TV streaming device or another existing device, such as those listed above, existing hardware components within these devices, such as a modem, a decoder, a television tuner, etc. may be leveraged and used to implement functions associated with some embodiments of the present invention. For example, when integrated with cable/satellite set top box functionality, the existing tuner of the cable/satellite set top box may be used to receive and decode A/V signals, and an existing modem may be used to communicate audio and other information to one or more mobile devices. Set top boxes often include DVR and/or video-on-demand functionality that can be used as the source of A/V signals. When integrated with TV streaming device functionality, the existing device capabilities may be used to receive A/V signals from external sources, record A/V signals, playback A/V signals on the display device, and communicate with one or more mobile devices using existing Internet capabilities that allow the mobile device to access the unit 102, receive audio and other information, and remotely control the unit 102 and external devices.

Logic Flow for Audio Forwarding with Video Delay

FIG. 3 shows a flow diagram demonstrating processing of an A/V signal in accordance with various embodiments of the invention. The process begins with receipt of the audio and video signal at step 301. The audio and video signal may be components of a television signal transmitted from a satellite or cable service provider. Once the signal(s) are received, the signal may be decoded and components of it modified in step 302 by a signal processor. Modification of the signal may include adding a delay to the video portion of the signal. The amount of the delay will be specified to accommodate the perceived delay in the audio transmission to a mobile device. Modification of the signal may include adding other media, such as advertisements to the video portion of the signal. Modification may also include adding delay to the audio portion of the signal being sent with the video portion of the signal. In step 303, the delayed video and audio signal may be transmitted to a public display device with or without additional advertisements. In step 304, the non-delayed audio signal may be transmitted to a mobile device, for example, a smart phone, via a wireless router. The non-delayed audio signal sent to the mobile device is distinct from the delayed audio signal sent to the display device by virtue of it being sent without a delay being actively added to it. As discussed above, the non-delayed audio referred to throughout the application represents audio that is sent through a communications network to a mobile device without a delay being purposefully added to it; however, this audio will inherently experience a delay due to the transmission of the audio over the network.

Audio Forwarding with Video Delay by Content Provider

In an alternative embodiment, rather than having a separate processing unit perform the audio forwarding with video delay functionality, a content provider may perform its own audio forwarding with video delay. For example, as shown in FIG. 2, a content provider 201, 202 such as a satellite provider, a cable provider, or a terrestrial broadcaster or other program source, may transmit the audio portion 204 of an A/V signal without delay to one or more mobile devices 105 over first communication network 203 (e.g., a phone sound service provider such as an independent provider or the content provider itself through an appropriate license arrangement) such as a cellular telephone network or the Internet and transmit the A/V signal 205 itself with delay over its distribution system, e.g., for reception by cable/satellite box 101 at the viewing location. The delay could originate at the source or by an A/V processor at the destination, e.g., a box provided by the cable/sat provider. As discussed above, the delay added to the signal provided to display device 103 must account for the delay in transmission of the audio to mobile device 105. The audio signal transmitted to device 105 may include any additional content desired, such as an advertisement. In this embodiment, the mobile device 105 still includes an application, which may be used to add any additional delay required to improve the synchronization of the audio signal with the video signal displayed on device 103. The mobile device user may log on with the phone sound service provider 203. The mobile device user may adjust delay to synchronize the audio. Advertisements may be displayed on the mobile device screen. Use and user data may be compiled by the service provider.

Mobile Device Application Software

Whether receiving audio information from unit 102 or directly from a content provider, the mobile device 105 typically includes a software application that allows the device to access and control the audio, including adding a continuously variable amount of delay to the audio. The application may allow for connection to the unit 102 or other provider, similar to the way the SLINGPLAYER™ software provided by Sling Media allows mobile devices to access a SLINGBOX™ over the Internet. The application may provide a slider bar or other control for manually adjusting the amount of delay added to the audio. Additionally or alternatively, the incremental delay may be implemented automatically by the application, e.g., using a synchronization protocol. The application may allow for user control of other functions, e.g., volume/muting control of the audio. The application may also allow the user to pick from a variety of audio sources coordinated with display devices based on the user's location, e.g., by specifying the identifier associated with a display device at the user's location. The user's location may be implicit, e.g., by virtue of being within range of an identifiable wireless transceiver or processing unit, or explicit, e.g., using GPS or other location identifying means, to determine the user's location and hence available display devices. Accordingly, each mobile device receiving the audio signal may add incremental delay to the audio signal to provide device specific or network specific delay times.

In some embodiments, the application may be configured to display advertisements or other information on the screen of the mobile device during transmission of the audio to the mobile device. This information may be provided along with the audio or may be provided separately from the audio. Advertisements may be displayed while the application is in the process of accessing the audio information and/or during playback of the audio information. Advertisements to be displayed on a particular mobile device may be selected, for example, based on a location of the mobile device (e.g., based on, for example, the location of the unit 102, GPS information provided by the mobile device, or other location identifying information), the identity of the mobile device (e.g., based on the phone number, serial number, or communication address of the device), the identity of the user (e.g., based on a login name used to access the audio information), the content being displayed on the display device, the type/nature of the location (e.g., bar, airport, department store, etc.), or other information.

Information about the user(s) and/or the mobile device(s) that are receiving the audio information may be collected and provided to marketing and other entities. Such information may include, for example, the location, the A/V signal or channel being watched, the number of users watching an A/V signal or channel, and other information.

Exemplary Synchronization Protocol

FIG. 4 shows a flow diagram for synchronizing audio provided to a mobile device with a video signal on a display device in accordance with various embodiments of the invention. The flow diagram depicted in FIG. 4 shows how an application running on the mobile device or accessible by the mobile device may be configured for automatically adding delay to the received audio information by the application running in the mobile device. It should be noted that automatically adding delay is optional and may be used in lieu of, or in addition to, manual delay controls. Thus, for example, the application may support only manual delay adjustment, only automatic delay adjustment, or both automatic and manual delay adjustments.

In step 401, the application connects the device with a service provider capable of sending the audio signal to the mobile device. In step 402, the application selects a video source, which represents the video display with which the audio provided to the mobile device will be synchronized. The application analyzes the delay added to the video transmitted by the video source in step 403. The application analyzes the audio delay incurred by the wireless transmission to the mobile device in step 404. The requirements may depend on the device and may depend on the network's available rate of data transfer. In step 405, the delay input into the video is compared with the delay incurred during the audio transmission. In step 406, broadcast of the audio on the mobile device is adjusted to improve the accuracy of the mobile audio synchronization with the video display.

Alternative Processing for when the A/V Source is Changed

In some embodiments of the present invention, when the A/V source is changed (e.g., the channel is changed), it is desirable to immediately show on the video display that the A/V source has changed (e.g., by displaying some video content from the new A/V source), as opposed to delaying the video for the extended time described above (the audio for the new A/V source can be forwarded to the mobile device without delay). Of course, at some point, the video needs to be delayed as discussed above so that the mobile device(s) receive the audio before the corresponding video display. Various embodiments of the present invention may engage one of numerous options to display video information during this time lapse after the A/V source is changed. For, example the processing unit may cause display of a still shot during this programming interruption. The still shot may display a message, for example indicating that the new channel will be displayed in a few moments. Alternatively, the still shot may simply include a blank screen, such as a blue screen. In other embodiments, the still shot could display an advertisement. Additionally or alternatively, a progress indicator (e.g., a progress bar or countdown) may be displayed to give viewers an indication of when the video from the selected A/V source will be displayed.

Other embodiments of the present invention may accommodate this lapse by displaying an alternative video. The alternative video may also be an advertisement. The source of the alternative video may be a direct and non-delayed signal from the video signal source. In embodiments where the source of the video are a direct and non-delayed videos from the signal source, the display device would alternate from receiving the delayed video to receiving a non-delayed video and then switch back to a delayed video associated with the new channel.

In exemplary embodiments, if any button on the television remote control is pressed, the circuitry in the A/V processor unit detects activation of the remote control and immediately switches to the real time source by routing the A/V signal around the delay circuit and connecting it directly to the television. After a predetermined time has passed, for example 15 seconds, the processor unit may discontinue routing the non-delayed signal to the television and resume engaging the delay circuit to route the delayed A/V signal to the television. Each activation of the remote control may bypasses the delay circuit for the predetermined time.

In the event that there are multiple televisions in the same area, each A/V processor box may include a circuit capable of (e.g., at initial setup) learning the identifier codes transmitted by the TV remote control that belongs to the TV attached to that box. Therefore, each A/V processor only acts on the remote control learned by that box.

Mobile Device as an Input Device

Other embodiments of the present invention include the use of a mobile device, such as a smart phone, as an input device for a display device. The system and smart phone application permit the phones or devices equipped with the application to be an input device to interact with the content on the public display device—for example, to respond to questionnaires, enter contests, or request sending of product information to the smart phone owner's e-mail address, which is already programmed into the phone. Requested information may even be immediately downloaded to the smart phone for later retrieval.

Mobile Device as Television Remote Control

Other embodiments of the present invention include the use of a mobile device, such as a smart phone, as a controller for a display device. Embodiments of the present invention may be applied to public display devices, including televisions with private programming (usually local promotional content), such as televisions mounted on ceilings in malls and retail stores. Certain embodiments of the invention may also apply to public display devices that are meant for a single person viewing or private viewing, e.g., kiosks, personalized content displayed on a public display device, etc. Embodiments of the present invention permit audio corresponding to the display provided on these televisions through mobile devices, such as smart phones.

In these private programming situations, smart phone users may change the program content (for example, choose a product area of interest, or request floor locations of products) by using the smart phone as a remote control to control the display content of the television. For example, the user may select the content from a menu or press specialized buttons that appear on the smart phone screen. In these situations, the user's smart phone screen may change to align with the television presentation. Additionally, requested information may be downloaded to the smart phone for immediate retrieval. All of this is achievable while listening to audio corresponding to the video displayed on the display device.

Offloading Audio/Video Processing

In the embodiment shown and described above with reference to FIG. 1, the A/V processor 102 essentially performs all of the processing of the audio/video signal and distribution of the audio and video information, including adding delay to the video information. FIG. 5 schematically shows a system in which processing and distribution of audio and video information associated with an audio/video signal is divided between local and remote audio/video processors, in accordance with various embodiments of the invention. Here, the local A/V processor 502 (e.g., located in a bar or other establishment near the display device) sends the audio/video signal to an external A/V processor 506, e.g., over the Internet or other network via an Ethernet or WiFi connection 507. The external processor 506 may process audio/video signals for multiple processors 502 situated at the same location (e.g., multiple local processors 502 at a sports bar) and/or at multiple locations (e.g., a central server utilized by local processors 502 situated at different establishments). The external A/V processor 506 returns processed audio and video information to the A/V processor 502 via 507, which then distributes the audio information to the mobile device(s), in this example, via router 104, and distributes the video information 508 to the display device(s). Among other things, some expected advantages of such a distributed processing architecture include simplifying (and hence reducing the cost of) the local processor 502 while the servers that perform the intensive audio/video processing (particularly for HDTV signals) can reside at a central, managed location.

The external A/V processor 506 may return the audio and video information in a variety of forms. For example, the external A/V processor 506 may return undelayed audio information (optionally with undelayed video information) along with delayed audio and video information, such that the local processor 502 can simply forward the undelayed audio (and optionally video) information to the mobile device(s) and forward the delayed audio/video information to the display device(s) with little or no further processing by the local processor 502. Similarly, the external processor 506 may return a combined audio/video signal in which the video information is delayed from the audio information, and the processor 502 may transmit the combined audio/video signal to both the mobile device(s) and the display device(s) without adding any further delay to either transmission path (this solution might be acceptable for situations in which audio will not be played by the display device, although it generally would be unacceptable for situations in which audio will be played by the display device because the signal sent to the display device would include unsynchronized audio and video information). Alternatively, the external processor 506 may return the audio and video information separately, in which case the processor 502 may transmit the audio information to the mobile device(s) and provide delayed video information to the display device(s), substantially as described above (this solution generally would require that the local processor 502 include substantial processing resources capable of adding the delay to the video information and generating the appropriate signals to the display device). The external A/V processor 506 may insert A/V elements into the output stream.

Synchronized Personal Sound System for Live Presentations

The Americans with Disabilities Act (ADA) requires that certain public places loan hearing assistance devices to people with hearing impairments. However, these devices are often not available as required. Even when available, these devices are often inconvenient to get and return and may be unsanitary. Also, the devices can bring unwelcome notice to people who are using them. Venues are burdened with the cost, administration and maintenance of the devices.

Furthermore, many people with “normal” hearing capabilities often have difficulty hearing presentations clearly in public places because of background noise, poor acoustics, or minor hearing impairments that may be volume and/or frequency dependent. People who are not fluent in English, in particular, need to hear English words clearly and distinctly to interpret them.

In exemplary embodiments of a synchronized personal sound system for live presentations, a hearing assistance system uses personal mobile devices such as smartphones or tablet computers as the hearing assistance devices, with audio provided to such devices over a wireless system such as WiFi. Specifically, audio from an audio source such as a live presenter (e.g., a professor in a lecture hall, a presenter at a symposium, etc.) is sent to both room/PA speakers (e.g., via wires or wirelessly) and the personal mobile devices, with appropriate delays to allow the audio played from the personal mobile device to be synchronized with the audio heard by the listener from the room/PA speakers, i.e., accounting for the acoustic delay from the room/PA speakers to the listener's ears. For purposes of this discussion, “acoustic delay” is the time it takes sound to travel directly from a room/PA speaker to a listener's ears (e.g., sound travels approximately one foot per millisecond and therefore the acoustic delay for a seat 40 feet from a speaker is approximately 40 ms, while reflected sound will take longer to arrive) and a “reference signal” is a signal consisting of one or more electronic, acoustic, or visual components set to some predetermined fixed delay equal to or greater than the expected maximum delay in secondary signals; the total actual delay in secondary signals is matched to the reference signal by adding additional delay to the secondary signals, if necessary. Personal mobile devices are almost ubiquitous, discrete, sanitary, socially acceptable, and convenient to use for all people—not just people with hearing impairments. Such a “bring-your-own-device” (BYOD) system will also reduce the cost, administration and maintenance for venues.

Unlike analog audio transmitted extremely fast to room/PA speakers, audio processed by computers and transmitted to mobile devices via a wireless network such as WiFi is subject to delay (or latency). In venues where live analog sound is broadcast directly to room/PA speakers and is concurrently digitally processed and broadcast via Wi-Fi to mobile devices, the unprocessed sound from the room/PA speakers reaches the listeners' ears before the processed sound from the mobile devices. Rather than enhance the sound from the room speakers, the mobile device sound competes with the room speaker sound. The two sounds are out-of-synchronization and when heard together may be unintelligible.

Therefore, in exemplary embodiments, audio from an audio source is processed by an audio processor, which sends the audio to both the room/PA speakers and to the personal mobile devices. A reference signal delay time is established equal to the maximum expected mobile device digital signal path delay determined using earlier generation mobile devices. Therefore the reference signal delay time is greater than the expected delay time of more current generation mobile devices. A fixed room acoustic delay is determined by using the room size dimensions relative to the speaker locations or by sampling the acoustic characteristics of each room from multiple seat locations. The amount of fixed delay added to the audio signal sent by the audio processor to the room/PA speakers is equal to the reference signal delay minus the fixed room acoustic delay. The audio processor may include the wireless transmitter or may provide the audio to an external wireless transmitter.

Generally speaking, the amount of wireless delay to the personal mobile devices is constantly changing due mostly to variations in the radio frequency path. Therefore, in certain exemplary embodiments, the audio processor and/or an application running in the personal mobile devices performs a periodic sound matching process that substantially synchronizes the sound played from each personal mobile device with the sound as heard from the room/PA speakers by the listener using such device.

In one exemplary embodiment, the audio processor performs a sound matching process in which the audio processor periodically “pings” each individual mobile device to calculate the variable delay (e.g., in milliseconds) and the “ping” times are then used to adjust the variable delay added to each mobile device to substantially match the reference signal delay time. This sound matching process is preferably automatic and dynamic, involving precise periodic synchronization of the sound processor with the mobile devices in millisecond units of measure. In certain exemplary embodiments, an application running in the mobile device allows the user to manually fine tune the mobile device synchronization to adjust for different seat locations within the room or to suit personal sound preferences. Such reference signal synchronization can be accomplished as discussed in U.S. Pat. No. 8,505,054 issued Aug. 6, 2013, which is hereby incorporated herein by reference in its entirety.

In another exemplary embodiment, the audio sent to the room/PA speakers may be delayed by an amount equal to or greater than the reference signal delay, and an application running in each personal mobile device allows for manually and/or automatically adding/adjusting delay within the personal mobile device in order to synchronize the audio played by the personal mobile device with the audio as heard by listeners through the room/PA speakers. Users may manually fine tune the mobile device automatic synchronization to adjust for different seat locations within the room or to suit personal sound preferences. Such reference signal synchronization can be accomplished as discussed in U.S. Pat. No. 8,505,054 issued Aug. 6, 2013, which is hereby incorporated herein by reference in its entirety.

Delaying the room/PA speaker sound creates two other considerations.

First is “lip-sync.” Because the sound broadcast through the speakers and smartphones is delayed, the presenter's lips will move before the delayed sound is heard. However, since sound delays are measured in milliseconds (1000th of a second), the visual difference will generally not be noticed in public venues where precise lip movements are not as visible as they are on TV, for example.

Another consideration is that presenters speaking “live” with microphones will hear their voice delayed through the room/PA speakers. This can be distracting and make it difficult to concentrate. This situation is common in large venues such as live performance theaters and concert halls where much audible sound delay is caused by the acoustic characteristics of large rooms. Large venues often use directional room speakers to project the speaker sound towards the audience and away from the performers. Also, monitor speakers with non-delayed sound are often placed facing the performers, directly in front of the performers. Alternatively, performers can hear the non-delayed sound through ambient noise blocking or noise cancelling headphones, earbuds, or other ear inserts that are barely visible to the audience. The synchronized personal sound system described herein preferably incorporates these techniques into the system. Specifically, in certain exemplary embodiments, the audio processor provides undelayed audio to a monitor output, which may be connected to a monitor used or worn by the presenter.

FIG. 13 is a schematic diagram showing an audio processor in accordance with one exemplary embodiment. The audio processor receives audio signals from a microphone, musical instrument, or other audio source, which can include audio from a transmitted medium (e.g., radio station, television station, cable or satellite television converter box, etc.) or from a recorded medium (e.g., a CD player or DVD player). The audio processor provides undelayed audio to a monitor output, provides delayed audio to the room/PA speakers via an external audio amplifier, and provides delayed audio to the personal mobile devices via a WiFi router, as discussed above. The audio sent to the personal mobile devices is synchronized with the audio as heard from the room/PA speakers. The presenter listens to the undelayed audio from the monitor output, e.g., through a monitor speaker or earphones.

The following are some Synchronized Personal Sound System Supplemental Applications. These applications are particularly good for such things as business conferences and higher education situations although they can be used more generally. Some of these applications will require or benefit from an assistant to manage the system while the presenter is on stage. Some audience features would be used during breaks.

Silent Sound System (no sound over room/PA speakers). This can provide hearing assistance in small venues with no PA system. Here, the system can be used without providing audio to room/PA speakers.

Multiple sound channels (e.g., multiple wireless channels to the personal mobile devices), such as for multiple languages, multiple rooms in close proximity, or multiple presentations in one large hall (not on PA speakers).

Recording (in certain embodiments, audience recording functions can be enabled/disabled by session leader), for example:

-   -   Record and save entire lecture or speech, e.g., with bookmarking     -   Instant playback with recording—e.g., 10 seconds per tap     -   Instant playback w/o recording—e.g., 10 seconds per tap     -   Instant recording w/o playback—e.g., 10 seconds per tap, record         only “sound bites”, similar to highlighting text;     -   Review (e.g., user can go back X seconds, e.g., to listen again         to something that was unclear, and then go back to “live” sound         either directly or fast-forwarding through intervening audio         (e.g., user can hear the intervening audio but played at double         speed).

Visuals (all visuals are displayed in sync with audio recordings at later playback), for example:

-   -   Display/download slides and videos on mobile device screens     -   Display/download session notes, key phrases or keywords (which         are defined)     -   Display scripted closed captioning     -   Display instant speech-to-text closed captioning     -   Display instant replay closed captioning—10 seconds per tap,         useful for ESL situations     -   Display teleprompter text     -   Display advertisements, particularly from conference sponsors     -   Display any visuals on main screen at front of house, as well as         on mobile device screens

Interactions (between presenter and audience, between audience members), for example:

-   -   Name badge info (or student name) can be automatically logged         in, plus entry and exit times     -   Audience can type questions that are selectively displayed by         presenter on all devices     -   Live audience questions: presenter activates for group, audience         members tap button, this cuts out PA speakers, audience member         speaks close to their smartphone, sound transmitted through all         connected headphones including presenter's     -   Audience members record questions privately that are put on a         queue and selectively played by session leader, these questions         could be recorded during breaks     -   Session bulletin board to post comments and questions for all to         view     -   Presenter bulletin board, only presenter sees questions and         comments     -   Meet & Greet board—audience members post their contact info,         sorted by interest area     -   Learning assessments: “pop” quiz questions given orally by         lecturer, or by text, quizzes can be individually automatically         scored and grades saved     -   Audience survey questions, collective audience results displayed         on all devices and main screen

Other features, for example:

-   -   Live or time-shifted broadcast of audio over the Internet, or         downloadable audio files     -   Simulate a room's reflected sound characteristics by sending         additional audio signal(s) to the mobile devices with additional         delays. This will recreate the auditorium effect which is often         lost when using assisted listening devices.     -   Add low level continuous masking background noise to block         ambient noise better, for audience and presenter.     -   Add an inverted sound wave of the source audio input to the         presenter's monitor earphones. This inverted sound wave is         synchronized to the acoustic sound emanating from the room         speakers as heard at the presenters' earphones. This “anti-sound         wave” will partially cancel the out-of-phase sound from the room         speakers that penetrates the monitor earphones.     -   Add a magnetic inductive loop which is connected to the sound         processor to the podium or other front-of-house area of the room         to be used for no-delay sound monitoring with compatible         telecoil earphones     -   Add password protected Wi-Fi sound channels to prevent         unauthorized listening outside of the room. Password protected         sound channels can also be used to allow the presenter,         conference managers and selected audience members in the room to         hear additional audio.     -   Add a private two-way sound channel to communicate with the         presenter during pauses.     -   Add audio public announcements and advertisements and to sound         channels during breaks that can be heard outside of rooms in         common areas.         Miscellaneous

Generally speaking, embodiments of the present invention ensure that the audio associated with a video or A/V signal is received at the mobile devices before the video is displayed on the display device, such that delay can be added to the audio manually and/or automatically to synchronize the audio to the video. In some exemplary embodiments described above, delay is expressly added to the video (e.g., by the content provider or by a separate processing unit). In certain alternative embodiments, however, the audio may arrive or be made to arrive at the mobile devices before the video is displayed without expressly adding delay to the video. For example, a content provider may transmit the audio signal over a high-speed medium (e.g., a wireless network such as a cell phone network, or the Internet) while the video or A/V signal traverses a slower-speed medium (e.g., a cable or terrestrial broadcasting system that inherently adds delay to the signal such as from satellite or other transmission devices through which the signal passes). Still, the application in the mobile device can be used to synchronize the audio to the video.

In some exemplary embodiment described above, a fixed amount of delay is added to the video and then a variable amount of delay is added to the audio in order to synchronize the audio to the video. In certain alternative embodiments, however, a fixed amount of delay may be added to the audio and a variable amount of delay may be added to the video to synchronize the audio and video, or variable amounts of delay may be added to both the video and the audio (e.g., the video delay may be variable and may be set to obtain near-synchronization with the expected delay of the audio signal reaching the mobile devices, with a variable amount of audio delay added by each mobile device to fine-tune synchronization and compensate for timing variations as discussed immediately below). In this regard, the mobile device may send synchronization information to the A/V processor or other device handling the video delay so that such device can delay the video by an amount that synchronizes (or substantially synchronizes) the video with the delayed audio played by the mobile device. For example, the mobile device may be configured to delay the audio by a fixed (or substantially fixed) amount and to convey delay information to the video delay device, or the mobile device may receive audio information with synchronization codes (discussed further below) and convey synchronization code information to the video delay device. The video delay device may receive synchronization information from multiple mobile devices and delay the video to accommodate the mobile devices, e.g., by delaying the video by an amount equal to the average of the mobile device delays or by an amount equal to the greatest of the mobile device delays.

In practice, the amount of delay needed to synchronize the audio to the video may vary over time, for example, based on such things as wireless signal strength, signal interference, varying delays in the transport network (e.g., cell phone network, Internet, etc.), distance from the transmitter, etc. The delay can also be different among users depending on such things as their relative distances from the transmitter and differences in their wireless devices. Thus, certain alternative embodiments of the present invention include provisions for monitoring and matching the relative delays of the audio and video streams automatically over time, with additional adjustments made manually as needed.

In one exemplary embodiment, the application in the mobile device sends “hello” or “ping type messages to the audio content server from time to time (e.g., multiple times per second) and upon receiving responses from the server, measures the round-trip delay and adjusts the amount of audio delay based on the round-trip delay. The round-trip delay is indicative of (but not necessarily equal to) twice the server-to-client network latency. Among other things, this allows the application in each mobile device to adjust the audio delay based on its individual signal reception characteristics.

In another exemplary embodiment, the audio content server may send “hello” or “ping” type messages to the mobile device and upon receiving responses from the mobile device, measures the round-trip delay. In embodiments in which the mobile device adds a variable amount of delay to the audio, the server can report the round-trip delay to the mobile device or provide other information based on the round-trip delay to allow the mobile device to adjust the audio delay. In embodiments in which a variable amount of delay is added to the video signal, the round-trip delay can be used to adjust the video delay to synchronize with the amount of delay added to the audio.

Additionally or alternatively, the separate audio and/or video streams may include time-stamp or other synchronization codes to allow for synchronization of the audio and video, for example, by the mobile device and/or at the display device (e.g., by a set-top box). For example, a separate processing unit that transmits the audio to the mobile devices and adds a fixed delay to the video might include in each audio packet an absolute or relative time at which the audio packet is to be played by the mobile device (e.g., if the current time is 12:00:00 and the video is delayed by 5 seconds, then the timestamp on the corresponding audio packet might be 12:00:05, and, assuming the mobile devices receive the audio packet before 12:00:05, each mobile device would hold the audio packet until 12:00:05 and then play it such that synchronization with the video would be substantially achieved provided that the mobile devices are substantially synchronized with the processing unit, e.g., based on a common reference clock such as those available from the Internet or from cellular phone networks). Generally speaking, the time-stamping would need to be sufficiently granular for specifying fractions of a second (e.g., milliseconds). Similar time-stamping could be added to the audio and/or video streams by the content provider.

FIG. 6 schematically shows a system that supports direct-from-provider audio/video synchronization, in accordance with various embodiments of the invention. Here, the source may provide a real-time and/or delayed audio (with or without video) signal to the mobile device 605 (represented by links 606 and 607 from program provider 601 and cable/satellite provider 602, respectively). For example, the signal sent to the mobile device 605 may include undelayed audio and optionally synchronization codes, allowing the mobile device 605 to synchronize the audio with video displayed on the display device 603, which may be delayed by the source or perhaps inherently delayed by the medium over which the video signal travels (e.g., satellite and/or terrestrial broadcasting network). Such synchronization may be done by the mobile device 605 automatically and/or with manual adjustment. Additionally or alternatively, the source may provide a real-time and/or delayed audio and/or video signal (represented by link 609 from the program provider 601 or cable/satellite provider 602) to an audio/sync processor 604, which in turn provides a real-time and/or delayed audio (with or without video, and with or without synchronization codes as appropriate for the application) signal to the mobile device 605 (represented by link 608) while providing a video (with or without audio, and with or without synchronization codes as appropriate for the application) signal for display on the display device 603 (represented by link 610, which may be sent to an intermediate device such as set-top box, which would process any synchronization codes included in the signal).

It should be noted that automatic synchronization is not limited to round-trip delay and time-stamping. Other delay measurement and automatic synchronization techniques may be used to set an initial time delay and/or make incremental adjustments to account for variations over time.

In practice, the mobile devices must be able to recover from various types of error conditions, most typically lost audio packets, but also other error conditions such as audio packets that arrive too late or that arrive out-of-order.

In order to deal with the issue of lost packets, each audio sample may be sent in two independent packets for redundancy. For example, each audio packet may be transmitted in duplicate. As long as one of the packets arrives at the mobile device in time to be played in synchronization with the corresponding video, the mobile device can play the audio information. If the mobile device receives an audio packet too late for the audio packet to be played, the mobile device typically just ignores or discards the audio packet.

Additionally or alternatively, sequence numbers may be included in the audio packets so that the mobile devices can detect if an audio packet has been missed or if audio packets have arrived out-of-order (in which case the packets can be re-ordered into the correct sequence). If duplicate audio packets are transmitted as described immediately above, the sequence number would generally be the same for both duplicate packets.

In the event that one or more audio packets has been missed or packets are not received in time to synchronize with the video (i.e., there is a gap in the audio information received by the mobile device), the mobile device typically injects one or more “dummy” packets to fill the gap left by the missing packet(s). An injected packet may contain no audio data or may contain “dummy” data such as a small level of white noise to match the perceived volume of the surrounding audio (i.e., the audio information received before and perhaps after the missing packet) or audio data interpolated or extrapolated from the surrounding audio data (i.e., the audio information received before and perhaps after the missing packet).

In an exemplary embodiment, the application in the mobile device may be configured to manage the audio packets using a circular buffer. Received packets are placed appropriately into the circular buffer, and “dummy” packets are injected into the circular buffer as needed. Generally speaking, the circular buffer needs to accommodate audio data for at least the amount of time the video is delayed. In situations where the amount of audio delay is variable, the application in the mobile device may make automatic adjustments over time as discussed above. Typically, the application in each mobile device maintains a target delay that represents the combination of such factors as the amount of delay requested by the user, network variations that induce loss or latency changes, and slight variations in the analog-to-digital and digital-to-analog conversion clocks on the server and client respectively. As discussed above, the delay may be adjusted automatically from time to time to compensate for variable network delay. Actual delay may only be measured periodically and with a large (but uniform) error in a user-mode application, so an exponential moving average may be used to predict the true delay with minimal hysteresis. The client then discards packets or injects dummy packets appropriately to match the predicted true delay to the target delay, within a given tolerance.

In certain exemplary embodiments, communication between the processing unit (server) and the mobile devices may utilize universal datagram packets (UDP) over an 802.11 wireless network. A combination of broadcast and host-to-host packets are used to organize the network, transmit audio data, and measure network latency. The server typically broadcasts its IP address to all subnet IP addresses periodically, and if a client mobile device is on the subnet, it will receive the server broadcast and can initiate communication with the server. As discussed above, each client periodically sends beacon packets to the server to indicate that the client still is participating in the network. In certain embodiments, the server aggregates client information and includes this information in the beacon, so clients may initiate their own peer-to-peer connection on the subnet without the assistance of the server. The server typically sends audio data as PCM samples which are compressed using the DEFLATE algorithm (gzip implementation). The compression allows scaling the audio stream to different bit-levels of precision without changing the protocol, as the compression will eliminate bits that do not carry information.

In some exemplary embodiments described above, only the audio signal is sent to the mobile devices. In various alternative embodiments, however, both the audio and video signals may be sent to some or all of the mobile devices (the user may be able to choose whether to receive only audio or both audio and video, perhaps for additional cost to receive the video), and the mobile device is configured to synchronize both the audio and video to the video displayed on the separate display device. Such functionality may be useful, for example, to allow the user to continue viewing the video even when away from the separate device or when viewing of the separate display device is obstructed.

It should be noted that the application in the mobile device provides not only the synchronization functionality and related functionality (e.g., provision features such as registration for the service, logging into the service, accepting payments relating to the service, user authentication for the service generally and/or for specific features such as restrictions on particular television channels, and capturing usage information) but also represents a platform from which additional functionality can be provided. For example, the application may include an interface for playing games that are specifically tied to the TV programming such as sports games or quiz show games. Individual responses to these games may be scored via the Internet. These games could be played individually or in social groups. Furthermore, the application may include additional social network features including connecting with others in the local site (e.g., the sports bar) or connecting with others who are not local (at home, for example). Here the non-local people could see the same mobile screen info (e.g., fantasy sports games mentioned above) and be included in group scoring. Games could include players from multiple locations all aggregated over the Internet (e.g., people in different bars all playing the same trivia game). The application for the mobile devices may be a separate application designed specifically for the audio/video synchronization service or may be a third party application (e.g., an application for a smart phone designed for receiving television programming or video-based content) to which the audio/video synchronization functionality is added.

In some exemplary embodiments described above, a processing unit may process A/V signals for multiple programs, even from multiple sources. For example, the processing unit may process multiple television channels (e.g., received from cable, satellite, or over-the-air broadcasts) as well as content obtained from the Internet and even locally generated content (e.g., from a local server in communication with the processing unit or from the processing unit itself), with different video streams sent to different televisions or monitors and the corresponding audio distributed as discussed herein. In this way, for example, the processing unit may process any of a variety of “custom” A/V signals designed specifically for the service such as, for example, a channel that contains advertisements, entertainment (trivia quizzes, quotes, etc.), news, weather, sports scores, etc.

It should be noted that, while some exemplary embodiments are discussed above with regard to specific applications and locations (e.g., sports bars, malls, etc.), the present invention is not limited to any particular applications and locations. Thus, for example, embodiments of the present invention may be used in applications including without limitation restaurants and bars, malls and stores, nightclubs, museums, stadiums, billboards, schools (e.g., lecture halls), theaters (including drive-in theaters), concerts, amusements, and guided and unguided tours, and other indoor and outdoor venues for, among other things, providing convenient wireless audio to accompany and enhance such things as TV viewing, location-based (place-based) advertising, digital out-of-home applications, electronic signage, and digital signage, using wireless audio (and video) receiving devices that millions of people carry with them every day.

The technology is particularly useful for situations in which it is impractical to play audio through speakers, such as environments that are too noisy to hear the audio (e.g., pubs, stadiums), environments in which there are different video programs that would require multiple concurrently-played audio programs (e.g., sports bars, shopping centers), environments in which different audio programs are provided for a video (e.g., multi-language versions of the audio), environments in which the viewers are not congregated at a single location or otherwise are unable to hear the audio (e.g., due to distance or because of physical barriers such as viewing through a window), environments that require quiet or silence (e.g., libraries, hospitals, houses of worship, homes), and environments in which no audio equipment is available for playing the audio (e.g., audio for a portable video system can be provided wirelessly without having to lug around an audio amplifier and speakers). Generally speaking, embodiments of the present invention may be employed in any of a wide variety of indoor and outdoor venues, in and/or on any of a wide variety of structures and vehicles, with stationary or mobile video displays, and for any of a wide variety of applications including public and private showings, advertising applications, and “digital signage” applications. Without limitation, the described technology may be employed in the following types of locations and situations:

Arts, Entertainment and Recreation, e.g., theaters, drive-in movies, zoos, aquariums, tourist sites, gyms, spas, concerts, shows, exhibitions, amusement parks, multimedia presentations, laser light shows, planetaria, IMAX™ theaters.

Banking and Finance, e.g., retail locations of banks, credit unions, brokerage companies, ATM locations.

Bars, Pubs, Nightclubs, Casinos, e.g., sports bars where one or more TVs are often used to show one or more sporting events, dance clubs (e.g., “silent” dances where no music is played through speakers and all audio is carried wirelessly to personal audio players).

Corporate Communications, e.g., office building lobbies, elevators, meeting rooms, training rooms, auditoriums, warehouses, cafeterias, advertisements, motivational/inspirational messages.

Digital Out-Of-Home Networks, e.g., doctor's offices, convenience stores, gas stations, public buildings, WiFi hotspots.

Educational Facilities, e.g., schools, colleges, universities, training sites, libraries, museums, lecture halls, testing rooms, driver's education courses, bar exam review courses, group learning environments, personal learning environments.

Government Facilities, e.g., local, county, state, federal, administrative (e.g., registry of motor vehicle locations for such things as learner's permit testing, remedial driving school, etc.), public service announcements and alerts.

Healthcare, e.g., hospital waiting rooms, group areas, nursing homes, assisted living facilities, rehabilitation centers.

Homes, e.g., wireless audio for TV or music, multi-room synchronized audio and video.

Hotels, Motels, Resorts, e.g., lobbies, poolside video, ski lodges.

Houses of Worship, e.g., in churches, temples, mosques and other religious building for such things as broadcasting ceremonies (e.g., religious services, sermons, religious rites, weddings, Baptisms, Bar and Bat Mitzvahs), to enable silent prayer, to support multi-language events (e.g., a presentation given in English and Hebrew), or to distribute information such as event and holiday schedules, inspirational messages, promotions.

Public Spaces, e.g., public access buildings, city square electronic display boards such as in Times Square, public events, roadside signage, traffic signage.

Restaurants, e.g., fast food restaurants, diners, full service restaurants, coffee houses.

Retail, e.g., retail stores, strip centers, malls.

Stadiums and Arenas, e.g., sports stadium electronic display boards such as video replays on stadium “jumbo boards,” concert halls, auditoriums, trade shows.

Transportation, e.g., in transportation buildings (e.g., airports, train and bus stations, ship ports), onboard vessels (e.g., airplanes, cruise ships, buses, trains, ferries, taxis, limos).

Electronic signage, e.g., stationary signage (e.g., billboards, placards, stadium displays, digital menu boards), portable signage, mobile signage (e.g., signs placed on moving vehicles such as buses, trains, cars, trucks, blimps that are frequency at large sporting events).

Multiple Location Viewing, e.g., multi-location synchronized viewing (e.g., movies, TV shows, sporting events), multi-location gaming (e.g., online gaming), multi-location testing (e.g., such as for SATs, PSATs, LSATs, online universities), simulcasting.

Projected Video, e.g., a video projector or other projection device (e.g., laser light show) used to project video onto a surface (e.g., an indoor wall or ceiling, an outdoor surface such as the side of a building, etc.) with accompanying audio provided wirelessly.

“Silent” Events, e.g., silent dances in which music is distributed wirelessly and synchronized as discussed herein, optionally in conjunction with videos (such as music videos) being shown on one or more monitors. Among other things, different videos may be shown on different monitors and people may have the ability to independently choose which audio stream to receive for listening/dancing such that different people may be dancing to different music at any given time and people can switch between the different audio programs as they move around and dance with other people (e.g., tune-in to a slow song to slow-dance with one person, then tune-in to a fast song to dance with a group of people). Among other things, such functionality allows for different types of music and different types of dances to be ongoing at the same time, e.g., a ballroom dance, a square dance, a line dance, a hip hop dance, a salsa dance, and a disco dance all happening at the same time, where people can switch from one dance to another at any time.

Silent dances have been gaining in popularity as they allow for parties to be held without violating noise ordinances in such places as apartments, condominiums, dormitories, and even outdoors, although silent dances are generally held using wireless headsets that are tuned to receive a broadcast audio signal without any particular synchronization or simply by having all participants download a common playlist and attempt to play the playlist at the same time. Embodiments of the present invention avoid the pitfalls of such approaches (e.g., distributing headsets is expensive and there are hygienic issues involved with sharing headsets among strangers, while playlists are static and require that all participants manually attempt to synchronize the audio, for example, by pressing “play” on their respective devices at the same time, which even then does not ensure synchronization due to inherent differences between devices) while providing for enhanced functionality such as multiple audio/video programs.

Of course, such “silent” events do not necessarily need to be silent, as there can be background audio played aloud and users can selectively choose (i.e., by wearing or removing their earpieces) whether to listen to the background audio or participate in one or more audio programs. Among other things, this allows for different types of functions to be held at the same location. For example, a “silent” event may include a sports bar in which some people are watching a game with the audio played aloud while at the same time a “silent” dance is being held next to the bar. For another example, a “silent” event may include a presentation in which a lecturer is speaking to a group of people but alternative video presentations are displayed on monitors with the corresponding audio provided and synchronized wirelessly, allowing users to selectively listen to the lecture and/or watch a video.

It should be noted that embodiments can provide for different audio streams to be provided to different mobile devices, e.g., some users may want English language, some users may want Spanish language, etc. Additionally or alternatively, the audio provided to the mobile device may come from a source different than the source of the video (with or without audio) that is displayed on the display device. For example, as depicted schematically in FIG. 7, an A/V source (e.g., a cable, satellite, or program provider 702) may provide an A/V signal 707 (with or without delay, without or without synchronization codes) to be displayed on display device 703 (optionally via an intermediate device such as A/V processor 704, which may send at least audio 708 to mobile device 705 and at least video to display device 703, adding delay to one or both signals as needed for a particular application), while the mobile device 705 receives a separate audio (with or without video) signal 706 via a separate path from the same provider or from a different provider from a separate provider such as a web-based provider and synchronizes the audio from signal 706 to the video displayed on display device 703 using any of the synchronization techniques described herein. The audio in signal 706 may be different than the audio in signal 707; in one specific example (without limitation), the signal 707 may be a national broadcast of a sporting event (e.g., a NFL football game on national television) while the audio in signal 706 may be the audio from a local television or radio station broadcast (e.g., directly from a local station web site or through a service such as NFL Audio Pass, which provides local radio announcements of NFL games). Alternatively, the audio in signal 706 may be the same as the audio in signal 707, for example, to avoid re-transmission of the audio and/or video portion of a signal inherent in some of the delay schemes described herein (e.g., the mobile device user may subscribe to or otherwise have permission to receive signal 706 while the location of the display device 703 may have permission to receive signal 707, such

It should be noted that the synchronization techniques described herein can be used to synchronize audio/video playback on local and mobile devices. For example, upon receipt of an A/V signal such as a television program by a processing unit, the processing unit may delay the entire A/V signal provided to an attached television while transmitting the A/V signal to the mobile devices over the wireless network such that each mobile device synchronizes the received A/V signal to the A/V signal on the television as described above.

While some exemplary embodiments are described above with reference to synchronizing audio and/or video on a mobile device to a reference signal, it should be noted that some alternative embodiments may provide for the mobile device to receive audio and/or video information without synchronizing to a reference signal. Thus, for example, the application running in the mobile devices may include one, the other, or both of a mode of operation in which audio and/or video information is synchronized to a reference signal and a mode of operation in which audio and/or video information is received and played without synchronization to a reference signal. As but one example, an exemplary embodiment may permit a museum or tour operator to provide the mobile device with unsynchronized general audio and/or video information in one exhibit (e.g., when the user is looking at a painting or sculpture, the user may receive an audio/video presentation about the artwork) and to provide audio information to be synchronized with a video display in another exhibit (e.g., the exhibit may include a video with the audio provided wirelessly to the mobile device so that others people nearby are not disturbed by the audio). The application in the mobile device may be configured to automatically determine when particular content requires synchronization and when other content does not require synchronization and may switch its operation between the two modes accordingly. In doing so, the application in the mobile device may, from time to time, need to synchronize with different reference signals (e.g., a processing unit servicing one exhibit may use different delay parameters than a processing unit (or the same processing unit) servicing a different exhibit, in which case the application in the mobile device may synchronize with the appropriate reference signal.

Thus, embodiments of the present invention may transmit one or more streams of synchronized audio and/or video signals via WiFi or other wireless protocols, where the signals may be received as part of a television or other A/V signal and/or from other sources such as radio, MP3 players, DJ systems, etc.

As discussed above, the user may (and generally does) have the ability to independently choose which audio stream to receive when there are multiple audio streams available (e.g., when there are multiple televisions). In certain exemplary embodiments, the application running on the user's mobile device allows the user to select an audio stream from among the available audio streams. This can be accomplished in any of a variety of ways.

In one exemplary embodiment, the application running in the user's mobile device obtains information from the server regarding the available channels (e.g., six channels available, numbered 1-6) and displays the available channel numbers on a user interface screen from which the user can see the currently selected channel (e.g., by having the channel number of the currently selected screen highlighted) and/or can select a channel for listening.

FIG. 8 shows an exemplary user interface screen in accordance with such an embodiment. Here, the application displays on the user interface screen numeric “buttons” 802 corresponding to the number of available channels (in this example, six channel number “buttons” 802 numbered 1-6 are shown to indicate that six channels are available). The user can select one of the six available channels. One particular embodiment supports up to ten channels and so the application may display from 1 to 10 channel numbers depending on the actual number of channels that are available, as indicated by the server. As mentioned above, the currently selected channel may be indicated on the user interface screen, e.g., by highlighting the currently selected channel number button or displaying the currently selected channel number. Also included in this user interface screen are the manual delay adjustment controls 804 and volume control 806 discussed above. Also included in this user interface screen is a “show on TVs” button 801 that, when pressed by the user, causes the application to send a signal to the server, which in turn causes the corresponding channel number to be temporarily displayed on each participating television screen (e.g., superimposed on the television program) so that the user can see which audio stream is associated with which television. In this exemplary embodiment, the same display functionality is also performed when the user presses one of the channel number buttons 802. It should be noted that, in various alternative embodiments, rather than displaying the channel number on the television screen, a channel number could be affixed to or otherwise displayed near the television to inform users of the channel number associated with the television.

In an alternative embodiment, system may support a larger number of channels (e.g., more than ten channels), in which case the application may display on the user interface screen digits that the user can select to enter the channel number (e.g., the user interface screen may include the digits from 0-9 and the user can enter channel number, such as a two or three digital channel number). FIG. 9 shows an exemplary user interface screen having the separate digits 902 from which the user can enter the desired channel number. FIG. 10 shows a variation of FIG. 9 in which the selected channel number 1002 is displayed.

In yet another alternative embodiment, the channel may be selected using the built-in camera of the user's mobile device. For example, in one exemplary embodiment, using the application in the user's mobile device, the user points the camera towards a desired television or item related to the desired television. The application in the user's mobile device captures an image (e.g., of the television screen, a portion of the television screen, or other item) and the desired television (and hence the desired audio stream channel) is identified based on the image. Such identification can be performed by the application in the user's mobile device (in which case the application can automatically switch to the desired channel) or by the server (e.g., the application conveys the image to the server, which identifies the desired television/channel and conveys the desired channel back to the application, which automatically switches to the desired channel). The television/channel may be identified using any of a variety of identification schemes, such as, for example, a code displayed on the television screen or near the television screen (e.g., bar code, QR code, or similar code pattern), character recognition of a channel number on or near the television screen, interpretation of a sequence of symbols appearing on a portion the television screen similar to a visual Morse code, a comparison of the captured image to screen images being displayed on the various televisions, or by sampling very specific multiple portions of the screen and using patterns created by the content of the multiple sample areas to match a desired screen image to screen images being displayed on the various televisions, to name but a few. A zoom control may be provided so that the user can “frame” the television screen within a predetermined window on the user interface screen or zoom in to a portion of the television screen (a portion containing a channel number or code) or to a related item (e.g., a number posted on or near the television).

FIG. 11 shows an exemplary user interface screen having an image window 1102, a zoom control 1104, and a “select” button 1106 for capturing an image in order to identify the desired television/channel. In this exemplary embodiment, the user would press the button on the zoom control 1104 to turn on the camera, view the image to be captured in the window 1102 while optionally sliding the zoom control 1104 to zoom in or out as needed, and then press the zoom control button again or alternatively press a select button 1106 to capture the image. It should be noted that all of the manual image capture functions could be executed by oral commands or automated so that the user just has to point the camera towards a television screen or other item to initiate the image capture sequence. FIG. 12 shows a representation of a portion of a television screen containing a pattern that can be used to identify the desired television/channel.

Additionally or alternatively to using the camera, in some embodiments, the application can be configured to detect orientation of the phone such that the user can just point the mobile device in the direction of a television to switch the sound channel sound without framing, zooming in on a code, or tapping on a screen button. For example, the phone could detect the direction of a highly directional and optionally encoded WiFi signal originating at a television.

In another alternative embodiment, the sound channel number on a user's mobile device may be selected utilizing the voice recognition capabilities of the mobile device. In this embodiment, the user speaks the desired sound channel number into the mobile device microphone. The application utilizes the mobile device's voice recognition capability, which conveys the desired sound channel number back to the application. The application then switches to the channel number spoken by the user, e.g., by transmitting the selected sound channel to the user's mobile device over a communication connection.

In addition to switching mobile device sound channels, voice recognition may also be used for inputting commands for other input functions as discussed above.

In yet another embodiment, the sound channel number on a user's mobile device may be selected utilizing short range (i.e., approximately 30 feet or less) radio frequencies, such as Bluetooth or Wi-Fi, to detect the proximity of a mobile device. In this embodiment the computer hardware and application software associated with a particular screen (the master device) will be in stand-by mode ‘listening’ for mobile devices (slave devices) containing the application that are in “seek” mode. If slave devices in seek mode come within range of a master device, communication will automatically commence through the exchange of data packets. When the master device discovers new slave devices in the network range, inquiry packets are sent to those devices. Any slave devices in seek mode reply with packets containing device information. The master device authenticates the slave devices then sets the sound channel number in the slave devices to the sound channel number associated with the proximate screen.

Regardless of how the sound channel is selected, the audio/video processor server may associate the selected sound channel with the user's mobile device to allow for interaction between the mobile device and the server for a particular video screen or screens (in addition to the server providing the audio for the selected sound channel to the user's mobile device over a wireless communication connection), such as for interaction with and through content presented by the server on the video screen (e.g., data entry fields, selectable links, etc.).

Thus, for example, a user may use the mobile device to enter an “interactive” mode in which the server correlates user inputs from the mobile device with a particular video screen (or screens) based on the selected sound channel. For example, the server may place a pointer, cursor, or other icon (e.g., a video game piece) on the video screen(s), and the user may be able to navigate via designated keys or the navigation device (e.g., trackball, trackpad, actions made through a touchscreen, etc.) on the mobile device, with corresponding navigation commands sent by the mobile device to the server. Through such navigation, the server may allow the user to perform such actions as enter data in designated data fields on the screen (e.g., to complete a survey, order food or drink from a menu, provide comments, etc.), select active links on the screen, play a video game alone or with others at the establishment or even elsewhere, participate in an interactive chat with others at the establishment or even elsewhere, highlight elements on the screen to obtain information about such elements (e.g., select a word and open a search engine to search the word), and perform other interactive functions as a particular implementation might provide.

Furthermore, the user may be permitted by the server to switch between multiple interactive screens through selection of the sound channel. For example, one screen may provide an interactive chatroom, one screen may provide a questionnaire; one screen may provide an interactive menu, etc., and the user may switch from screen to screen by selecting the appropriate sound channel.

Additionally or alternatively, the mobile device may provide interactive screens through which the user can interact with the content of a particular video screen or more generally with the server. The interactive screens may be generated by the server and transmitted to the mobile device, or the interactive screens may be generated by the mobile device, e.g., based on a context provided by the server. Among other things, this provides a level of privacy for the user interactions, and also facilitates interactions with a particular video screen by multiple users simultaneously. The video display device screen and the mobile device screen may be used in tandem to allow for interactions between the user and the system. For one example, the video display device screen may be used to display a menu of selectable items (e.g., a menu of food/beverages) and the screen presented by the mobile device may allow the user to order selections from the menu. For another example, the video display device screen may be used to display a multiple choice survey question (e.g., “What are the chances of the Patriots winning the Super Bowl next year?”) with various answer choices and the screen presented by the mobile device may allow the user to provide an answer to the question. A wide variety of other tandem uses are possible.

As discussed above, a user may select a sound channel merely for listening, independently of watching corresponding video on a video screen. In certain implementations, audio can be provided without any video screens. In other implementations, multi-channel audio is provided with a single video screen, where the server presents on the video screen video corresponding to the selected sound channel. Thus, for example, the audio/video server may support multiple modes of operation, such as, for example, multiple sound channels with multiple video feeds (e.g., displaying different video feeds on different video screens and allowing each user to select the sound channel associated with a given video screen), multiple sound channels with a single video feed (e.g., allowing a user to select a sound channel and displaying the corresponding video on the screen), and multiple channel audio with no video feeds.

In any of the embodiments described herein, an application running on the mobile device may be configured to detect that the mobile device is within range of the wireless sound channels being transmitted by the system and may automatically select a sound channel (e.g., a default sound channel that provides a menu of other selectable sound channels and/or services) or generate an alert to the user (e.g., a ring tone, a vibration, an icon on the mobile device display screen, a menu of available sound channels, etc.) upon which the user can select a sound channel. A menu of available sound channels may include, for example, sound channel numbers, video screen numbers, TV channel numbers, sound channel topics, and/or other information allowing the user to identify the sound channel and/or video associated with the sound channel. For example, a menu might just list the categories of videos currently being displayed (e.g., Bruins Hockey, Red Sox Pre-Season Baseball, Celtics Basketball, College Basketball Tournament, etc.) and the server identifies the selected sound channel based on the user's selection from the menu.

Furthermore, the server can correlate user activities with a corresponding user or user account. For example, a user may order food or drinks from an interactive menu as discussed above, and the server may automatically add the cost to the user's tab, e.g., identified based on the user's mobile device. In some cases, the server may be able to locate the user based on the location of the user's mobile device (e.g., based on GPS, proximity detection, etc.) and therefore may direct where the user's order is delivered, e.g., to a particular booth, table, or bar seat; to a patio; poolside to a particular table or lounge chair; in a casino to a particular game table or slot machine; at an entertainment venue (e.g., to a particular pool table or bowling lane), etc.

It should be noted that terms such as “computer,” “set top box,” “phone,” “smartphone,” “router,” and “modem” may be used herein to describe devices that may be used in certain embodiments of the present invention and should not be construed to limit the present invention to any particular device type unless the context otherwise requires. Thus, a device may include, without limitation, a bridge, router, bridge-router (brouter), switch, node, server, computer, appliance, or other type of device. Such devices typically include one or more network interfaces for communicating over a communication network and a processor (e.g., a microprocessor with memory and other peripherals and/or application-specific hardware) configured accordingly to perform device functions. Communication networks generally may include public and/or private networks; may include local-area, wide-area, metropolitan-area, storage, and/or other types of networks; and may employ communication technologies including, but in no way limited to, analog technologies, digital technologies, optical technologies, wireless technologies (e.g., Bluetooth), networking technologies, and internetworking technologies.

It should also be noted that devices may use communication protocols and messages (e.g., messages created, transmitted, received, stored, and/or processed by the device), and such messages may be conveyed by a communication network or medium. Unless the context otherwise requires, the present invention should not be construed as being limited to any particular communication message type, communication message format, or communication protocol. Thus, a communication message generally may include, without limitation, a frame, packet, datagram, user datagram, cell, or other type of communication message. Unless the context requires otherwise, references to specific communication protocols are exemplary, and it should be understood that alternative embodiments may, as appropriate, employ variations of such communication protocols (e.g., modifications or extensions of the protocol that may be made from time-to-time) or other protocols either known or developed in the future.

It should also be noted that logic flows may be described herein to demonstrate various aspects of the invention, and should not be construed to limit the present invention to any particular logic flow or logic implementation. The described logic may be partitioned into different logic blocks (e.g., programs, modules, functions, or subroutines) without changing the overall results or otherwise departing from the true scope of the invention. Often times, logic elements may be added, modified, omitted, performed in a different order, or implemented using different logic constructs (e.g., logic gates, looping primitives, conditional logic, and other logic constructs) without changing the overall results or otherwise departing from the true scope of the invention.

The present invention may be embodied in many different forms, including, but in no way limited to, computer program logic for use with a processor (e.g., a microprocessor, microcontroller, digital signal processor, or general purpose computer), programmable logic for use with a programmable logic device (e.g., a Field Programmable Gate Array (FPGA) or other PLD), discrete components, integrated circuitry (e.g., an Application Specific Integrated Circuit (ASIC)), or any other means including any combination thereof. In a typical embodiment of the present invention, predominantly all of the described logic is implemented as a set of computer program instructions that is converted into a computer executable form, stored as such in a computer readable medium, and executed by a microprocessor under the control of an operating system.

Computer program logic implementing all or part of the functionality previously described herein may be embodied in various forms, including, but in no way limited to, a source code form, a computer executable form, and various intermediate forms (e.g., forms generated by an assembler, compiler, linker, or locator). Source code may include a series of computer program instructions implemented in any of various programming languages (e.g., an object code, an assembly language, or a high-level language such as Fortran, C, C++, JAVA, or HTML) for use with various operating systems or operating environments. The source code may define and use various data structures and communication messages. The source code may be in a computer executable form (e.g., via an interpreter), or the source code may be converted (e.g., via a translator, assembler, or compiler) into a computer executable form.

Computer program logic implementing all or part of the functionality previously described herein may be executed at different times on a single processor (e.g., concurrently) or may be executed at the same or different times on multiple processors and may run under a single operating system process/thread or under different operating system processes/threads. Thus, the term “computer process” refers generally to the execution of a set of computer program instructions regardless of whether different computer processes are executed on the same or different processors and regardless of whether different computer processes run under the same operating system process/thread or different operating system processes/threads.

The computer program may be fixed in any form (e.g., source code form, computer executable form, or an intermediate form) either permanently or transitorily in a tangible storage medium, such as a semiconductor memory device (e.g., a RAM, ROM, PROM, EEPROM, or Flash-Programmable RAM), a magnetic memory device (e.g., a diskette or fixed disk), an optical memory device (e.g., a CD-ROM), a PC card (e.g., PCMCIA card), or other memory device. The computer program may be fixed in any form in a signal that is transmittable to a computer using any of various communication technologies, including, but in no way limited to, analog technologies, digital technologies, optical technologies, wireless technologies (e.g., Bluetooth), networking technologies, and internetworking technologies. The computer program may be distributed in any form as a removable storage medium with accompanying printed or electronic documentation (e.g., shrink wrapped software), preloaded with a computer system (e.g., on system ROM or fixed disk), or distributed from a server or electronic bulletin board over the communication system (e.g., the Internet or World Wide Web).

Hardware logic (including programmable logic for use with a programmable logic device) implementing all or part of the functionality previously described herein may be designed using traditional manual methods, or may be designed, captured, simulated, or documented electronically using various tools, such as Computer Aided Design (CAD), a hardware description language (e.g., VHDL or AHDL), or a PLD programming language (e.g., PALASM, ABEL, or CUPL).

Programmable logic may be fixed either permanently or transitorily in a tangible storage medium, such as a semiconductor memory device (e.g., a RAM, ROM, PROM, EEPROM, or Flash-Programmable RAM), a magnetic memory device (e.g., a diskette or fixed disk), an optical memory device (e.g., a CD-ROM), or other memory device. The programmable logic may be fixed in a signal that is transmittable to a computer using any of various communication technologies, including, but in no way limited to, analog technologies, digital technologies, optical technologies, wireless technologies (e.g., Bluetooth), networking technologies, and internetworking technologies. The programmable logic may be distributed as a removable storage medium with accompanying printed or electronic documentation (e.g., shrink wrapped software), preloaded with a computer system (e.g., on system ROM or fixed disk), or distributed from a server or electronic bulletin board over the communication system (e.g., the Internet or World Wide Web). Of course, some embodiments of the invention may be implemented as a combination of both software (e.g., a computer program product) and hardware. Still other embodiments of the invention are implemented as entirely hardware, or entirely software.

Without limitation, some additional embodiments of the present invention may be characterized by the potential claims listed in the paragraphs following this paragraph (and before the actual claims provided at the end of this application). These potential claims form a part of the written description of this application. Accordingly, the subject matter of the following potential claims may be presented as actual claims in later proceedings involving this application or any application claiming priority based on this application Inclusion of such potential claims should not be construed to mean that the actual claims do not cover the subject matter of the potential claims. Thus, a decision to not present these potential claims in later proceedings should not be construed as a donation of the subject matter to the public. Without limitation, potential subject matter that may be claimed (prefaced with the letter “P” so as to avoid confusion with the actual claims presented below) includes:

P1. A method of synchronizing an audio and video signal comprising: receiving an audio/video signal; forwarding the audio signal to a first device without delay; and forwarding the video signal to a second device with a delay.

P2. (Entire system): A system for synchronizing audio distributed to a mobile device with a video displayed on a display device, the system comprising:

a. a processing unit including

i. a receiver for receiving an A/V signal,

ii. a decoder configured to separate the A/V signal into a video and an audio component,

iii. a video encoder configured to add a delay to the video portion of the A/V signal, and

iv. at least one transmitter configured to transmit at least one of the encoded video and the audio components of the A/V signal;

b. at least one display device connected to the transmitter of the processing unit, the display device receiving the encoded video component of the A/V signal from the processing unit; and

c. a router connected to the transmitter, the router configured to receive the audio signal from the transmitter and wirelessly transmit the audio signal to a mobile device without the delay.

P3. (Set Top Box): A system for distributing audio to a mobile device in synchronization with video displayed on a display device, the system comprising:

a. a processing unit including

i. a receiver configured to receive an A/V signal transmitted from a service provider,

ii. a demodulator configured to separate the A/V signal into a video and audio component,

iii. an encoder configured to add a delay to the video component of the A/V signal, and

iv. a transmitter configured to transmit the encoded video component of the A/V signal to a display device and transmit the audio portion of the A/V signal to a mobile device without the delay.

P4. (Program/Sequence running in box top unit): A system for synchronizing audio received on a mobile device with a video displayed on a display device, the system comprising:

a signal processor, the signal processor configured to receive a video signal and an audio signal from a service provider, transmit the video signal to a display device with a delay, and transmit the audio signal to a mobile device without the delay, wherein the delay is selected such that the video signal will be displayed on the display device after receipt of the audio signal by the mobile device.

P5. (App on Processing Unit) A computer program product for synchronizing audio on a mobile wireless device with a delayed video displayed on a display device, the computer program product comprising a computer usable medium having computer readable program code thereon, the computer readable program code comprising:

a. program code for determining an arrival time of an audio signal at the mobile device, the audio signal wirelessly transmitted to the mobile device without a delay

b. program code for delaying transmission of a video signal to a display device based on the arrival time of an audio signal at the mobile device such that the video signal is transmitted to the display device in synchronization with the audio on the mobile device in synchronization with the delayed video being displayed on the display device.

P6. (automatic mobile audio sync.) A computer program product for synchronizing audio on a mobile wireless device with a delayed video displayed on a display device, the computer program product comprising a computer usable medium having computer readable program code thereon, the computer readable program code comprising:

a. program code for determining a display time for the display of the delayed video on the display device

b. program code for incrementally adjusting the amount of delay by which the audio signal is broadcast on the mobile device such that the audio signal broadcasts on the mobile device in synchronization with the delayed video being displayed on the display device.

P7. (manual mobile audio sync.) A computer program product for synchronizing audio on a mobile wireless device with a delayed video displayed on a display device, the computer program product comprising a computer usable medium having computer readable program code thereon, the computer readable program code comprising:

b. program code for incrementally adjusting the amount of delay by which the audio signal is broadcast on the mobile device such that the audio signal broadcast on the mobile device in synchronization with the delayed video being displayed on the display device.

P8. A method for synchronizing audio and video, the method comprising:

transmitting audio information associated with an A/V signal to at least one mobile device over a communication network that delays delivery of the audio information to the at least one mobile device;

transmitting to at least one display device video information associated with the A/V signal delayed by a predetermined delay greater than the delay introduced by the communication network.

P9. A method according to claim 6, further comprising:

adding further delay to the audio information by each mobile device to substantially synchronize the audio with the video displayed on the display device.

P10. Systems, apparatus, and methods according to claims P1-9, further including one or more of the following:

-   -   transmitting advertisements to the display device;     -   transmitting advertisements to the mobile device;     -   basing the advertisement on the location of the mobile device         and/or owner of the mobile device;     -   basing the advertisement on the location of the display device         and nature of business within which the display device is         located;     -   alternating the display device source from a delayed source to         an non-delayed source and back to a delayed source;     -   transmitting delayed audio to the display device;     -   using the mobile device to remotely control the display device;         and     -   using the mobile device to generate messages for display on the         display device.

P11. An external audio/video processor for use in combination with one or more local audio/video processors, wherein the external audio/video processor processes audio/video signals transferred from the local audio/video processor(s) and returns processed audio and/or video information back to the local audio/video processor(s) for distribution to one or more mobile devices and one or more display devices as described herein.

P12. A system, apparatus, and method in which the mobile device receives audio from a source different than the source of the video provided for display on the display device and synchronizes the audio to the video.

P13. A system, apparatus, and method in which the A/V source provides audio (with or without video) to the mobile device and provides delayed video (with or without delayed audio) for display on the display device, such that the mobile device can synchronize the audio it receives with the video displayed on the display device.

P14. A system, apparatus, and method in which the A/V source provides an audio (with or without video) signal to the mobile device and provides a video (with or without audio) signal for display on the display device, wherein at least one of the audio signal and the video signal includes synchronization codes, such that the mobile device and/or a device that processes the video signal can synchronize the audio played by the mobile device with the video displayed on the display device.

P15. Any of the methods for selecting or changing the audio channel at the mobile device as discussed above.

P16. Apparatus for distributing a source audio/video signal containing video information and corresponding audio information, the apparatus comprising:

a first communication interface couplable to a first communication network for sending at least the audio information from the source audio/video signal to be played by a number of mobile devices;

a second communication interface couplable to a second communication network for sending at least the video information from the source audio/video signal to be displayed on a video display device; and

an audio/video processor configured to transmit first signals containing audio information from the source audio/video signal over the first communication network via the first communication interface and to transmit second signals containing video information from the source audio/video signal over the second communication network via the second communication interface with the transmitted video information delayed relative to the transmitted audio information by a predetermined amount of delay sufficient to ensure that the audio information arrives at the mobile devices before the video information is displayed on the video display device, whereby each mobile device can independently adjust the amount of delay it adds to the received audio information to synchronize audio played by the mobile device with the video displayed on the video display device.

P17. Apparatus according to claim P16, wherein the second signals comprise the source audio/video signal delayed relative to the transmitted audio information.

P18. Apparatus according to claim P16, wherein the first communication network comprises the Internet.

P19. Apparatus according to claim P16, wherein the second communication network comprises a cable television network.

P20. Apparatus according to claim P16, wherein the second communication network comprises a satellite television network.

P21. Apparatus according to claim P16, further comprising a converter coupled to the second communication network for receiving the second signals and displaying the video information on the video display device.

P22. A method of distributing a source audio/video signal containing video information and corresponding audio information, the method comprising:

transmitting first signals containing audio information from the source audio/video signal over a first communication network to be played by a number of mobile devices; and

transmitting second signals containing video information from the source audio/video signal over a second communication network to be displayed on a video display device, the transmitted video information delayed relative to the transmitted audio information by a predetermined amount of delay sufficient to ensure that the audio information arrives at the mobile devices before the video information is displayed on the video display device, whereby each mobile device can independently adjust the amount of delay it adds to the received audio information to synchronize audio played by the mobile device with the video displayed on the video display device.

P23. A method according to claim P22, wherein the second signals comprise the source audio/video signal delayed relative to the transmitted audio information.

P24. A method according to claim P22, wherein the first communication network comprises the Internet.

P25. A method according to claim P22, wherein the second communication network comprises a cable television network.

P26. A method according to claim P22, wherein the second communication network comprises a satellite television network.

P27. A method for distributing an audio/video signal by an audio/video processor, the method comprising:

receiving, by the audio/video processor, a combined audio/video signal in which video information associated with a source audio/video signal is delayed from corresponding audio information associated with the source audio/video signal;

transmitting, by the audio/video processor, at least the audio information from the combined audio/video signal to at least one mobile device over a communication system; and

transmitting, by the audio/video processor, at least the video information from the combined audio/video signal to a display device separate from the at least one mobile device, so that each mobile device can independently adjust the amount of delay it adds to the received audio information to synchronize audio played by the mobile device with the video displayed on the display device.

P28. A method according to claim P27, wherein transmitting at least the audio information from the combined audio/video signal comprises:

decoding the combined audio/video signal to obtain the audio information.

P29. A method according to claim P27, wherein transmitting at least the video information from the combined audio/video signal comprises decoding the combined audio/video signal to obtain the video information.

P30. An audio/video processing apparatus for distributing an audio/video signal to allow for synchronization of video and audio information, the apparatus comprising:

an audio/video signal receiver;

a communication interface for communication with at least one mobile device;

a display device interface for communication with at least one display device; and

an audio/video processor coupled to the audio/video signal receiver, the communication interface and the display device interface, the audio/video processor configured to receive a combined audio/video signal in which video information associated with a source audio/video signal is delayed from corresponding audio information associated with the source audio/video signal, transmit at least the audio information from the combined audio/video signal to at least one mobile device over a communication system, and transmit at least the video information from the combined audio/video signal to a display device separate from the at least one mobile device, so that each mobile device can independently adjust the amount of delay it adds to the received audio information to synchronize audio played by the mobile device with the video displayed on the display device.

P31. An audio/video processing apparatus according to claim P30, wherein the audio/video processor is further configured to decode the combined audio/video signal to obtain the audio information.

P32. An audio/video processing apparatus according to claim P30, wherein the audio/video processor is further configured to decode the combined audio/video signal to obtain the video information.

P33. An audio/video distribution system comprising:

a first audio/video processing apparatus configured to transmit a combined audio/video signal in which video information associated with a source audio/video signal is delayed from corresponding audio information associated with the source audio/video signal; and

a second audio/video processing apparatus configured to receive the combined audio/video signal, transmit at least the audio information from the combined audio/video signal to at least one mobile device over a communication system, and transmit at least the video information from the combined audio/video signal to a display device separate from the at least one mobile device, so that each mobile device can independently adjust the amount of delay it adds to the received audio information to synchronize audio played by the mobile device with the video displayed on the display device.

P34. A system according to claim P33, wherein the source audio/video signal is generated internal to the first audio/video processing apparatus.

P35. A system according to claim P33, wherein the source audio/video signal is received by the first audio/video processing apparatus from an external audio/video signal source.

P36. A system according to claim P33, wherein the source audio/video signal is received by the first audio/video processing apparatus from the second audio/video processing apparatus.

P37. A method of distributing an audio/video signal by an audio/video processing apparatus, the method comprising:

receiving a source audio/video signal having video information and corresponding audio information;

producing from the source audio/video signal a combined audio/video signal in which the video information associated with the source audio/video signal is delayed from the corresponding audio information associated with the source audio/video signal; and

transmitting the combined audio/video signal to an audio/video processing apparatus.

P38. A method according to claim P37, wherein the source audio/video signal is received from the audio/video processing apparatus.

P39. A method according to claim P37, wherein the source audio/video signal is produced by the audio/video source.

P40. A method according to claim P37, wherein the combined audio/video signal further includes additional information that is not in the source audio/video signal.

P41. A method according to claim P40, wherein the additional information includes an advertisement.

P42. A method according to claim P41, wherein an advertisement is based on at least one of:

the location of the mobile device;

the identity of the mobile device;

the identity of the mobile device user;

the content being displayed on the display device; and

the nature of the location of the display device.

P43. A method according to claim P41, wherein the additional information includes a device identifier, the device identifier enabling mobile device users to identify the display device for selectively receiving the associated audio information.

P44. An audio/video source comprising an audio/video processor configured to receive a source audio/video signal having video information and corresponding audio information, produce from the source audio/video signal a combined audio/video signal in which the video information associated with the source audio/video signal is delayed from the corresponding audio information associated with the source audio/video signal, and transmit the combined audio/video signal to an audio/video processing apparatus.

P45. An audio/video source according to claim P44, wherein the source audio/video signal is received from the audio/video processing apparatus.

P46. An audio/video source according to claim P44, wherein the source audio/video signal is produced by the audio/video source.

P47. An audio/video source according to claim P44, wherein the combined audio/video signal further includes additional information that is not in the source audio/video signal.

P48. An audio/video source according to claim P47, wherein the additional information includes an advertisement.

P49. An audio/video source according to claim P48, wherein an advertisement is based on at least one of:

the location of the mobile device;

the identity of the mobile device;

the identity of the mobile device user;

the content being displayed on the display device; and

the nature of the location of the display device.

P50. An audio/video source according to claim P47, wherein the additional information includes a device identifier, the device identifier enabling mobile device users to identify the display device for selectively receiving the associated audio information.

P51. A mobile communication device comprising:

an audio output device;

a wireless communication interface; and

an audio processor configured to receive audio information associated with a source audio/video signal via the wireless communication interface, communicate with an audio/video processor via the wireless communication interface to receive synchronization information for synchronizing the received audio information with corresponding video information from the source audio/video signal displayed on a video display device; add delay to the received audio information in accordance with the received synchronization information, and play the delayed audio information via the audio output device.

P52. An audio distribution device comprising:

an audio interface for receiving audio information from an audio source;

an advertisement interface for receiving video advertisements from an advertisement source;

a communication interface for communicating with at least one mobile device over a wireless communication system; and

an audio/video processor configured to transmit, to each of a number of mobile devices, audio information to be played through an audio output of the mobile device and video advertisements to be displayed on a display device of the mobile device during such playing of the audio information.

P53. The audio distribution device of claim P52, wherein the audio source is internal to the audio distribution device.

P54. The audio distribution device of claim P52, wherein the audio source is external to the audio distribution device.

P55. The audio distribution device of claim P52, wherein the advertisement source is internal to the audio distribution device.

P56. The audio distribution device of claim P52, wherein the advertisement source is external to the audio distribution device.

P57. The audio distribution device of claim P52, wherein the audio/video processor is configured to select the video advertisements based on at least one of

a location of the audio distribution device; or

a type of business associated with the location of the audio distribution device.

P58. The audio distribution device of claim P52, wherein the audio/video processor is configured to select video advertisements for each mobile device.

P59. The audio distribution device of claim P58, wherein the audio/video processor is configured to select the video advertisements for a given mobile device based on at least one of

a location of the mobile device;

an identity of the mobile device;

an identity of a user of the mobile device; or

the audio information.

P60. The audio distribution device of claim P52, further comprising a display device interface for communication with a display device separate from the mobile devices.

P61. The audio distribution device of claim P60, wherein the audio/video processor is configured to display video advertisements on such separate display device via the display device interface.

P62. The audio distribution device of claim P60, wherein the audio information is derived from an audio/video signal, and wherein the audio/video processor is configured to display video information from the audio/video signal on such separate display device via the display device interface.

P63. The audio distribution device of claim P62, wherein the audio/video processor is configured to delay the video information by a predetermined amount of delay greater than an amount of delay added to the audio information by the wireless communication system, so that the audio information is received by the mobile devices at or before the video information is displayed on the display device and each mobile device can independently adjust the amount of delay it adds to the received audio information to synchronize audio played by the mobile device with the video displayed on the display device.

The present invention may be embodied in other specific forms without departing from the true scope of the invention. Any references to the “invention” are intended to refer to exemplary embodiments of the invention and should not be construed to refer to all embodiments of the invention unless the context otherwise requires. The described embodiments are to be considered in all respects only as illustrative and not restrictive. 

What is claimed is:
 1. A computer program product comprising a tangible, non-transitory computer-readable medium having embodied therein computer program instructions that, when run on a mobile communication device of a user, cause the mobile communication device to execute a method for controlling audio associated with a delayed video displayed on a display device separate from the mobile communication device, the method comprising: a. receiving an audio signal from a processing unit over a wireless communication system, the audio signal containing packets of digitized audio information corresponding to delayed video information displayed on a display screen of a display device separate from the mobile communication device, wherein the audio information is received at the mobile communication device prior to the corresponding delayed video information being displayed on the display screen of the display device; b. performing a synchronization protocol with the processing unit to analyze audio delay incurred from the wireless communication system; c. determining, based on the incurred audio delay, an automatic delay adjustment for adjusting an amount of delay to be introduced by the mobile communication device when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; d. providing a user interface to the user including an audio delay control allowing the user to provide a manual delay adjustment for adjusting the amount of delay to be introduced by the mobile communication device when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; and e. outputting delayed audio information via an audio interface of the mobile communication device with a total amount of delay added to the audio information based on the automatic delay adjustment and the manual delay adjustment.
 2. The computer program product of claim 1, wherein the audio delay control allows the user to incrementally adjust the amount of manual delay adjustment.
 3. The computer program product of claim 1, wherein the method further comprises: receiving advertisements over the wireless communication system; and displaying the advertisements on a display device of the mobile communication device.
 4. The computer program product of claim 3, wherein the advertisements are selected based on at least one of: a location of the mobile device; an identity of the mobile device; an identity of a user of the mobile device; the video information being displayed on the display screen of the display device; or the audio information.
 5. The computer program product of claim 1, wherein the method further comprises: capturing an image of at least a portion of the display screen using an image capture device of the mobile communication device; and transmitting, to a server, information derived from the captured image for identifying an audio channel associated with the image from among a plurality of audio channels, wherein the audio information received by the mobile communication device is associated with the identified audio channel.
 6. The computer program product of claim 1, wherein the method further comprises: capturing an image associated with a display device from among a plurality of display devices using an image capture device of the mobile communication device; and transmitting, to a server, information derived from the captured image for identifying the display device, wherein the audio information received by the mobile communication device is associated with the identified display device.
 7. A mobile communication device comprising: a wireless communication interface; an audio interface; and an audio processor application for controlling audio associated with a delayed video displayed on a display device separate from the mobile communication device, the audio processor application configured to: a. receive an audio signal from a processing unit over the wireless communication interface, the audio signal containing packets of digitized audio information corresponding to delayed video information displayed on a display screen of a display device separate from the mobile communication device, wherein the audio information is received at the mobile communication device prior to the corresponding delayed video information being displayed on the display screen of the display device; b. performing a synchronization protocol with the processing unit to analyze audio delay incurred from the wireless communication system; c. determining, based on the incurred audio delay, an automatic delay adjustment for adjusting an amount of delay to be introduced by the mobile communication device when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; d. provide a user interface to the user including an audio delay control allowing the user to provide a manual delay adjustment for adjusting the amount of delay to be introduced by the audio processor application when outputting the audio information for synchronizing output of the audio information with the delayed video information displayed on the display screen of the display device; and e. output delayed audio information via the audio interface of the mobile communication device with a total amount of delay added to the audio information based on the automatic delay adjustment and the manual delay adjustment.
 8. The mobile communication device of claim 7, wherein the audio delay control allows the user to incrementally adjust the amount of manual delay adjustment.
 9. The mobile communication device of claim 7, wherein the audio processor application is further configured to: receive advertisements over the wireless communication system; and display the advertisements on a display device of the mobile communication device.
 10. The mobile communication device of claim 9, wherein the advertisements are selected based on at least one of: a location of the mobile device; an identity of the mobile device; an identity of a user of the mobile device; the video information being displayed on the display screen of the display device; or the audio information.
 11. The mobile communication device of claim 7, further comprising an image capture device, wherein the audio processor application is further configured to capture an image of at least a portion of the display screen using the image capture device and to transmit, to a server, information derived from the captured image for identifying an audio channel associated with the image from among a plurality of audio channels, wherein the audio information received by the mobile communication device is associated with the identified audio channel.
 12. The mobile communication device of claim 7, further comprising an image capture device, wherein the audio processor application is further configured to capture an image associated with a display device from among a plurality of display devices using the image capture device and to transmit, to a server, information derived from the captured image for identifying the display device, wherein the audio information received by the mobile communication device is associated with the identified display device.
 13. The computer program product of claim 1, wherein the user interface allows the user to select from a variety of audio sources in order to receive the audio information corresponding to the delayed video information displayed on the display screen of the display device.
 14. The computer program product of claim 13, wherein the mobile communication device is configured to determine available audio sources based on location of the mobile communication device.
 15. The computer program product of claim 14, wherein the location of the mobile communication device is determined based on proximity to an identifiable wireless transceiver.
 16. The computer program product of claim 14, wherein the location of the mobile communication device is determined based on proximity to the processing unit.
 17. The computer program product of claim 14, wherein the location of the mobile communication device is determined based on GPS information.
 18. The mobile communication device of claim 7, wherein the user interface allows the user to select from a variety of audio sources in order to receive the audio information corresponding to the delayed video information displayed on the display screen of the display device.
 19. The mobile communication device of claim 18, wherein the mobile communication device is configured to determine available audio sources based on location of the mobile communication device.
 20. The mobile communication device of claim 19, wherein the location of the mobile communication device is determined based on proximity to an identifiable wireless transceiver.
 21. The mobile communication device of claim 19, wherein the location of the mobile communication device is determined based on proximity to the processing unit.
 22. The mobile communication device of claim 19, wherein the location of the mobile communication device is determined based on GPS information. 